Constrained clusters of gene expression profiles with pathological features
نویسندگان
چکیده
MOTIVATION Gene expression profiles should be useful in distinguishing variations in disease, since they reflect accurately the status of cells. The primary clustering of gene expression reveals the genotypes that are responsible for the proximity of members within each cluster, while further clustering elucidates the pathological features of the individual members of each cluster. However, since the first clustering process and the second classification step, in which the features are associated with clusters, are performed independently, the initial set of clusters may omit genes that are associated with pathologically meaningful features. Therefore, it is important to devise a way of identifying gene expression clusters that are associated with pathological features. RESULTS We present the novel technique of 'itemset constrained clustering' (IC-Clustering), which computes the optimal cluster that maximizes the interclass variance of gene expression between groups, which are divided according to the restriction that only divisions that can be expressed using common features are allowed. This constraint automatically labels each cluster with a set of pathological features which characterize that cluster. When applied to liver cancer datasets, IC-Clustering revealed informative gene expression clusters, which could be annotated with various pathological features, such as 'tumor' and 'man', or 'except tumor' and 'normal liver function'. In contrast, the k-means method overlooked these clusters.
منابع مشابه
Multivariate Feature Extraction for Prediction of Future Gene Expression Profile
Introduction: The features of a cell can be extracted from its gene expression profile. If the gene expression profiles of future descendant cells are predicted, the features of the future cells are also predicted. The objective of this study was to design an artificial neural network to predict gene expression profiles of descendant cells that will be generated by division/differentiation of h...
متن کاملMultivariate Feature Extraction for Prediction of Future Gene Expression Profile
Introduction: The features of a cell can be extracted from its gene expression profile. If the gene expression profiles of future descendant cells are predicted, the features of the future cells are also predicted. The objective of this study was to design an artificial neural network to predict gene expression profiles of descendant cells that will be generated by division/differentiation of h...
متن کاملMesenchymal Stem/Stromal-Like Cells from Diploid and Triploid Human Embryonic Stem Cells Display Different Gene Expression Profiles
Background: Human ESCs-MSCs open a new insight into future cell therapy applications, due to their unique characteristics, including immunomodulatory features, proliferation, and differentiation. Methods: Herein, hESCs-MSCs were characterized by IF technique with CD105 and FIBRONECTIN as markers and FIBRONECTIN, VIMENTIN, CD10, CD105, and CD14 genes using RT-PCR technique. FACS was performed fo...
متن کاملMyeloid Cell Leukemia-1 Gene Expression and Clinicopathological Features in Myelodysplastic Syndrome
Background and Aims: Myeloid cell leukemia-1 (Mcl-1) plays a pivotal role in the survival of hematologic and solid tumors, and is known as a substantial oncogene. Studies have demonstrated the altered expression of Mcl-1 has been linked to malignancy development and poor prognosis. In this research, we have studied the expression of Mcl-1 mRNA in myelodysplastic syndrome (MDS) patients and det...
متن کاملE-cadherin Promoter Methylation Comparison and Correlation with the Pathological Features of the Squamous Cell Carcinoma of Esophagus in the High Risk Region
E-cadherin is among tumor suppressor genes which mostly subjects to the down-regulation in squamous cell carcinoma of esophagus (SCCE). The gene is tightly associated with the tumor invasion and metastasis in multiple human cancers, especially SCCE. CpG islands’ methylation in the promoter region of E-cadherin is among the mechanisms that have been suggested for the E-cadherin silencing, howeve...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 20 17 شماره
صفحات -
تاریخ انتشار 2004